Implementation and Evaluation of a Negation Tagger in a Pipeline-based System for Information Extraction from Pathology Reports

نویسندگان

  • Kevin J. Mitchell
  • Michael J. Becich
  • Jules J. Berman
  • Wendy W. Chapman
  • John R. Gilbertson
  • Dilip Gupta
  • James Harrison
  • Elizabeth Legowski
  • Rebecca S. Crowley
چکیده

We have developed a pipeline-based system for automated annotation of Surgical Pathology Reports with UMLS terms that builds on GATE--an open-source architecture for language engineering. The system includes a module for detecting and annotating negated concepts, which implements the NegEx algorithm--an algorithm originally described for use in discharge summaries and radiology reports. We describe the implementation of the system, and early evaluation of the Negation Tagger. Our results are encouraging. In the key Final Diagnosis section, with almost no modification of the algorithm or phrase lists, the system performs with precision of 0.84 and recall of 0.80 against a gold-standard corpus of negation annotations, created by modified Delphi technique by a panel of pathologists. Further work will focus on refining the Negation Tagger and UMLS Tagger and adding additional processing resources for annotating free-text pathology reports.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extraction of Drug-Drug Interaction from Literature through Detecting Linguistic-based Negation and Clause Dependency

Extracting biomedical relations such as drug-drug interaction (DDI) from text is an important task in biomedical NLP. Due to the large number of complex sentences in biomedical literature, researchers have employed some sentence simplification techniques to improve the performance of the relation extraction methods. However, due to difficulty of the task, there is no noteworthy improvement in t...

متن کامل

Computation Optical Flow Using Pipeline Architecture

Accurate estimation of motion from time-varying imagery has been a popular problem in vision studies, This information can be used in segmentation, 3D motion and shape recovery, target tracking, and other problems in scene analysis and interpretation. We have presented a dynamic image model for estimating image motion from image sequences, and have shown how the solution can be obtained from a ...

متن کامل

Design, Implementation, and Applicability Evaluation of Hip and Knee Arthroplasty Registry

Introduction: Arthroplasty is a major orthopedic operation with an increasing rate. The success of this operation can significantly reduce patients’ pain and disabilities. This study aimed to design a registry system for hip and knee arthroplasties. Method: A comprehensive search was conducted to retrieve minimum data set from articles, guidelines, forms and reports published by orthopedic soci...

متن کامل

Design, Implementation, and Applicability Evaluation of Hip and Knee Arthroplasty Registry

Introduction: Arthroplasty is a major orthopedic operation with an increasing rate. The success of this operation can significantly reduce patients’ pain and disabilities. This study aimed to design a registry system for hip and knee arthroplasties. Method: A comprehensive search was conducted to retrieve minimum data set from articles, guidelines, forms and reports published by orthopedic soci...

متن کامل

Identifying Metastases-related Information from Pathology Reports of Lung Cancer Patients

Metastatic patterns of spread at the time of cancer recurrence are one of the most important prognostic factors in estimation of clinical course and survival of the patient. This information is not easily accessible since it's rarely recorded in a structured format. This paper describes a system for categorization of pathology reports by specimen site and the detection of metastatic status with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Studies in health technology and informatics

دوره 107 Pt 1  شماره 

صفحات  -

تاریخ انتشار 2004